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Abstract 

We examine the complexity of the online Dictionary Matching with One Gap Problem 
(DMOG) which is the following. Preprocess a dictionary D oi d patterns, where each pat¬ 
tern contains a special gap symbol that can match any string, so that given a text that arrives 
online, a character at a time, we can report all of the patterns from D that are suffixes of the 
text that has arrived so far, before the next character arrives. In more general versions the gap 
symbols are associated with bounds determining the possible lengths of matching strings. 

Finding efficient algorithmic solutions for (online) DMOG has proven to be a difficult al¬ 
gorithmic challenge. Little progress has been made on this problem and to date and no truly 
efficient solutions are known. However, the need for efficient practical solutions has been on a 
rise since the online DMOG is a bottleneck procedure in the realm of cyber-security, as many 
digital signatures of viruses manifest themselves as patterns with a gap. Indeed, this paper was 
invoked by such a challenge. 

We first demonstrate that the difficulty in obtaining efficient solutions for the DMOG problem 
even, in the offline setting, can be traced back to the infamous 3SUM conjecture, by showing 
that an efficient solution for DMOG implies that the 3SUM conjecture is false. Interestingly, 
our reduction deviates from the known reduction paths that follow from 3SUM. In particular, 
most reductions from 3SUM go through the set-disjointness problem, which corresponds to the 
problem of preprocessing a graph to answer edge-triangles queries: given an edge, report all 
of the triangles containing this edge. We use a new path of reductions by considering the 
complementary, although structurally very different, vertex-triangles queries: given an vertex, 
report all of the triangles containing this vertex. Using this new path we show a conditional 
lower bound of D((5 (Gd) -I- op) time per text character, where Gd is a bipartite graph that 
captures the structure of D, 5{Gd) is the degeneracy of this graph, and op is the output size. 
Moreover, we show a conditional lower bound in terms of the magnitude of gaps for the bounded 
case, thereby showing that some known upper bounds are optimal. 

We also provide matching upper-bounds (up to sub-polynomial factors) for the vertex- 
triangles problem, and then extend these techniques to the online DMOG problem. In par¬ 
ticular, we introduce algorithms whose time cost depends linearly on S{Gd)- Our algorithms 
make use of graph orientations, together with some additional techniques. These algorithms 
are of practical interest since although 5(Gd) can be as large as \fd, and even larger if is 
a multi-graph, it is typically a very small constant in practice. Finally, when 5{Gd) is large we 
are able to obtain even more efficient solutions. 


1 Introduction 


Understanding the computational limitations of algorithmic problems can often lead to algorithms 
for such problems that work well in practice. This paper, which stemmed from an industrial-acdemic 
connection m, is a prime example of where proving algorithmic lower bounds lead to efficient 
practical algorithmic upper bounds. We focus on an aspect of Cyber-security which is a critical 
modern challenge. Network intrusion detection systems (NIDS) perform protocol analysis, content 
searching and content matching, in order to detect harmful software. Such malware may appear 
non-contiguously, scattered across several packets, which necessitates matching gapped patterns. A 
gapped pattern P is one of the form Pi {ai, /3i} P 2 {a 2 , /32} • • • {cKfc, fdk} Pk+i-, where each subpattern 
Pj is a string over alphabet S, and {aj^fdj} matches any substring of length at least aj and at most 
13j. Gapped patterns considered in NIDS systems typically have only one gap. Though the gapped 
pattern matching problem arose over 20 years ago in computational biology applications [281 do] 
and has been revisited many times in the intervening years (e.g. [271 El EH El HH |29l 132]), in 
this paper we study what is apparently a mild generalization of the problem that has nonetheless 
resisted many researcher’s attempts at finding a definitive efficient solution. 

The set of d patterns to be detected, called a dictionary, could be quite large. While dictionary 
matching is well studied (see, e.g. la IS ttn 0 E]), NIDS applications motivate the dictionary 
matching with one gap problem, defined formally as follows. 

Definition 1. The Dictionary Matching with One Gap Problem (DMOG), is: 

Input: A text T of length \T\ over alphabet S, and a dictionary D of d gapped patterns 

Pi,... ,Pd over alphabet S where each pattern has at most one gap. 

Output: All locations in T where a pattern Pi G D, 1 < i < d, ends. 

In the offline DMOG problem T and D are presented all at once. We study the more practical 
online DMOG problem. The dictionary D can be preprocessed in advance, resulting in a data 
structure. Given this data structure the text T is presented one character at a time, and when a 
character arrives the subset of patterns with a match ending at this character should be reported 
before the next character arrives. Three cost measures are of interest: a preprocessing time, a time 
per character, and a time per match reported. Online DMOG is a serious bottleneck for NIDS, 
and has received much attention from both the industry and the academic community. 

Previous Work. Finding efficient solutions for DMOG has proven to be a difficult algorithmic 
challenge as, unfortunately, little progress has been obtained on this problem even though many 
researchers in the pattern matching community and the industry have tackled it. Tabled] describes 
a summary and comparison of previous work. It illustrates that previous results are inadequate for 
NIDS, on the one hand, and that our upper bounds are basically optimal, on the other hand. 

1.1 Our Results 

The DMOG problem has several natural parameters, e.g., \D\, d, and the magnitude of the gap. We 
establish almost sharp upper and lower bounds for the cases of unbounded gaps (a = 0,/3 = 00 ), 
uniformly bounded gaps where all patterns have the same bounds on their gap, and the most 
general non-uniform gaps version. We show that the complexity of DMOG actually depends on a 
“hidden” parameter that is a function of the structure of the gapped patterns. The dictionary D 
can be represented as a graph Go, which is a multi-graph in the non-uniformly bounded gaps case. 
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Preprocessing 

Time 

Total Query Time 

Algorithm 

Type 

Remark 

m 

none 

0{\T\ + \D\) 

online 

reports only 
first occurrence 

m 

0(1^1) 

0{\T\+d) 

online 

reports only 
first occurrence 

m 

0(1^1) 

0{\T\ ■ Isc socc) 

online 

reports one occurrence 
per pattern and location 

[6] 

om) 

0{\T\{I3 - a) -1- op) 

offline 

DMOG 

m 

Oi\D\) 

0(|r|(/3*-a‘) + op) 

offline 

DMOG 

This 

paper 

0{\D\) 

d{\T\-S{GD)-lsc + op) 

online 

DMOG 

This 

paper 

Oi\D\) 

Oi\D\) 

n{\T\ ■ S{Gd)^-°'-^> + op) 
n{\T\ ■ [13 - + op) 

online 

or offline 

DMOG 


Table 1: Comparison of previous work and some new results. The parameters: Isc is the longest suffix chain of 
subpatterns in D, socc is the number of subpatterns occurrences in T, op is the number of pattern occurrences in T, 
a* and P* are the minimum left and maximum right gap borders in the non-uniformly bounded case, S{Gd) is the 
degeneracy of the graph Gd representing dictionary D. 


where vertices correspond to first or second subpatterns and edges correspond to patterns. We use 
the notion of graph degeneracy 6 {Gd) which is defined as follow. The degeneracy of an undirected 
graph G = (F, E) is 6{G) = max[/cT niin^g;/ dcuiu), where dc^ is the degree of u in the subgraph 
of G induced hy U. In words, the degeneracy of G is the largest minimum degree of any subgraph 
of G. It is straightforward to see that a non-multi graph G with m edges has 6{G) = 0{y/rn), and 
a clique has 5{G) = 0(yTu). The degeneracy of a multi-graph can be much higher. 

Vertex-triangle queries. A key component in understanding both the upper and lower bounds 
for DMOG is the vertex-triangles problem, where the goal is to preprocess a graph so that given a 
query vertex u we may list all triangles that contain u. The vertex-triangles problem, besides being 
a natural graph problem, is of particular interest here since, as will be demonstrated in Section [21 
it is reducible to DMOG. Our reduction demonstrates that the complexity of the DMOG problem 
already emerges when all patterns are of the form of two characters separated by an unbounded 
gap. This simplified online DMOG problem is equivalent to the following Induced Subgraph (ISG) 
problem. Preprocess a directed graph G = {V, E) such that given a sequence of vertices online, 
after vertex vi all edges {vj,Vi) E E with j < i are reported. Thus, characters and gapped patterns 
in DMOG correspond to vertices and edges in ISG, respectively. We show that vertex-triangles 
queries are reducible to ISG. 

This reduction serves two purposes. First, in Section [2| we prove a conditional lower bound 
(GLB) for DMOG based on the 3SUM conjecture by combining a straightforward reduction from 
triangle enumeration to the vertex-triangles problem with our new reduction from the vertex- 
triangles problem to DMOG. Our lower bound states that any online DMOG algorithm with low 
preprocessing and reporting costs must spend per character, assuming the 3SUM 

conjecture. Interestingly, the path for proving this GLB deviates from the common conceptual 
paradigms for proving lower bounds conditioned on the 3SUM conjecture, and is of independent 
interest. We provide an overview of this method in Section [1.21 and the details appear in Section [2j 
Moreover, our GLB holds for the ojfline case as well, and can be rephrased in terms of other 
parameters. For example, in the DMOG problem with uniform gaps {q;,/ 3}, we prove that the per 
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character cost of scanning T must be This gives some indication that some recent 

algorithms for the offline version of DMOG problem are almost optimal ([6l I2n|). 

Second, in Section [3] we provide optimal solutions, up to subpolynomial factors, for ISG and, 
therefore, also for vertex-triangles queries, with 0{\E\) preprocessing time and 0{5{G) + op) time 
per each vertex, where op is the size of the output due to the vertex. The connection between ISG 
and DMOG led us to extend the techniques used to solve ISG, combine them with additional ideas 
and techniques, thereby introduce several new online DMOG algorithms whose dependence on 6{G) 
is linear. Thus, graph degeneracy seems to capture the intrinsic complexity of the problem. On the 
other hand, the statement of our general algorithmic results is actually a bit more complicated as it 
depends on other parameters of the input, namely Isc, the longest suffix chain in the dictionary, i.e., 
the longest sequence of dictionary subpatterns such that each is a proper suffix of the next. While 
the parameter I sc could theoretically be as large as d, in practice it is very small m- Nevertheless, 
we also present algorithms that in the most dense cases reduce the dependence on Isc. We provide 
an overview of our algorithms in Section 11.31 and the details appear in Sections [3l [H and [5l 

Louver Bounds Leading to Practical Upper Bounds. After trying to tackle the DMOG 
problem from the upper bound perspective, we suspected that a lower bound could be proven, 
and indeed were successful in showing a connection to the 3SUM conjecture. The CLB proof 
provides insight for the inherent difficulty in solving DMOG, but is also unfortunate news for those 
attempting to find efficient upper bounds. 

Fortunately, after a careful examination of the reduction from 3SUM to DMOG we noticed that 
the GLB from the 3SUM conjecture can be phrased in terms of 6{Gd), which turns out to be a 
small constant in the input instances considered by NIDS. This lead to designing algorithms whose 
runtime can be expressed in terms of 8{Gd), and can therefore be helpful in practical settings. 

1.2 The Lower Bound Story 

Solving DMOG with poly-logarithmic time bounds seems to be an extremely difficult task, leading 
to the question of finding a polynomial time lower bound. Polynomial (unconditional) lower bounds 
for data structure problems are considered beyond the reach of current techniques. Thus, it has 
recently become extremely popular to prove GLBs based on the conjectured hardness of some 
problem. One of the most popular conjectures for GLBs is that the 3SUM problem (given n 
integers determine if any three sum to zero) cannot be solved in truly subquadratic time, where 
truly subquadratic time is 0(n^“^^^^) time. This conjecture holds even if the the algorithm is 
allowed to use randomization (see e.g. [301 E [221 US]). In Section [2| we show that the infamous 
3SUM problem can be reduced to DMOG, which sheds some light on the difficulty of the DMOG 
problem. Interestingly, our reduction does not follow the common paradigm for proving GLBs based 
on the 3SUM conjecture, providing a new approach for reductions from 3SUM. This approach is of 
independent interest, and is described next. 

Triangles. Patra§cu [30] showed that 3SUM can be reduced to enumerating triangles in a tripar¬ 
tite graph. Kopelowitz, Pettie, and Porat |22j provided more efficient reductions, thereby showing 
that many known triangle enumeration algorithms ([211 da [inii23]) are essentially and condition¬ 
ally optimal, up to subpolynomial factors. Hence, the offline version of triangle enumeration is 
well understood. The following two indexing versions of the triangle enumeration problem are a 
natural extension of the offline problem. In the edge-triangles problem the goal is to preprocess 
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a graph so that given a query edge e all triangles containing e are listed. The vertex-triangles 
problem is defined above. Clearly, both these versions solve the triangle enumeration problem, 
which immediately gives lower bounds conditioned on the 3SUM conjecture. 

It is fairly straightforward to see that the edge-triangles problem on a tripartite graph cor¬ 
responds to preprocessing a family F of sets over a universe U in order to support set intersec¬ 
tion queries in which given two sets S,S' G F the goal is to enumerate the elements in 5 n S'' 
(see [22] )• Indeed, the task of preprocessing F to support set-intersection enumeration queries, 
and hence edge-triangles, is well studied [mils]. Furthermore, the set intersection problem has 
been used extensively as a tool for proving that many algorithmic problems are as hard as solving 
3SUM [301[I1[22|- However, the vertex-triangles problem has yet to be considered directl30. 

We use the vertex-triangles problem in order to show that the ISG problem is hard, and thus 
the simplest DMOG version of (offline) unbounded setting is 3SUM-hard. The following theorems 
are simplified statements of the ones proven in Section [21 

Theorem 1. Assume 3SUM requires expected time. For any algorithm that solves the 

vertex-triangles problem on a graph G with m edges, if the amortized expected preprocessing time is 
0{m ■ and the amortized expected reporting time is sub-polynomial, then the amortized 

expected query time must be at least n{{d ■ 6{G))^~°^^^), where d is the degree of the queried vertex. 

Theorem 2. Assume 3SUM requires expected time. For any algorithm that solves 

the ISG problem on a graph G with m edges, if the amortized expected preprocessing time is 
0{m ■ and the amortized expected reporting time is sub-polynomial, then the amortized 

expected time spent on each vertex during a query must he at least II(((5(G))^“°(^)). 

Theorem 3. Assume 3SUM requires expected time. For any algorithm that solves 

the DMOG problem on a graph G with m edges, if the amortized expected preprocessing time is 
0(|Z?| and the amortized expected reporting time is sub-polynomial, then the amortized 
expected time spent on each text character must be at least Q{{5{G■ 

Theorem 4. Assume 3SUM requires expected time. For any algorithm that solves the 

uniformly bounded DMOG problem on a graph G with m edges, if the amortized expected prepro¬ 
cessing time is 0{\D\ ■ 5{Gand the amortized expected reporting time is sub-polynomial, 
then the amortized expected time spent on each text character must be at least n((/3 — 

A Note on Triangle Reporting Problems and Other Popular Conjectures. Many CLBs 
based on other popular conjectures, such as the Boolean Matrix Multiplication conjecture or the 
Online Matrix Vector Multiplication conjecture, use reductions from set-disjointness and hence 
from edge-triangles queries (see mm)- However, it is not clear how to obtain meaningful lower 
bounds for vertex-triangles queries based on these conjectures. These difficulties are discussed in 
Appendix El 

1.3 The Upper Bound Story 

Given Theorems[Tl[2l and[3l we focus on providing matching upper bounds for the online versions of 
the vertex-triangles problem, ISG, and DMOG. A table summarizing our upper-bounds for DMOG 

^The closely related problem of deciding whether a given vertex is contained by any triangle (a decision version) 
has been addressed [7]. 
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appears in Appendix [Bj In Section [3l we provide a solution for ISG using 0{m) preprocessing time 
and 0{5{G) + op) query time using the graph orientation technique, which is closely related to 
graph degeneracy. This matches the CLB from Theorem [21 and by applying the reduction from 
vertex-triangles queries to ISG, it also matches the GLB from Theorem [H 

Our ISG algorithms are then extended to versions corresponding to the simplified versions of 
the uniformly and non-uniformly bounded DMOG problems. This is shown in Sections 13. II and 13.21 
The time bounds, ignoring poly-log factors, remain the same, however, the space usage is slightly 
increased. Interestingly, for the non-uniform case we utilize 4-sided 2-dimensional orthogonal range 
reporting queries in a clever way. 

In Section HI the ISG algorithms are extended to solutions for the various DMOG versions. 
However, the longer subpatterns introduce new challenges that need to be tackled. First, since 
subpatterns can be suffixes of each other, up to Isc vertices can arrive simultaneously in which 
is the backbone of our algorithms. Thus, the time of our algorithms have a multiplicative factor 
of Isc. We emphasize that we are not the first to introduce the Isc factor in solutions for DMOG 
problems m- Second, since subpatterns may be long, we must accommodate a delay in the time 
a vertex corresponding to a second subpattern is treated as if it has arrived, thus inducing a minor 
additive space usage. 

Finally, the online DMOG algorithms in SectionHjhave a cost per character of 0{Isc-6{Gd)+op). 
In Section [5| we show that it is possible to obtain more efficient bounds that depend linearly 

on y/lsc ■ d when 6{Dg) > by first considering special types of graph orientations, called 

threshold orientations, and then carefully applying data-structure techniques. Notice that while 
in the uniformly bounded case we have 6{Gd) = 0{Vd), in the non-uniform case d{GD) could be 
much higher and so these new algorithms become a vast improvement. However these algorithms 
are mostly of interest from a theoretical perspective since in practice both the degeneracy and Isc 
are very small. 

2 3SUM: Conditional Lower Bounds 

In this section we prove that conditioned on the 3SUM conjecture we can prove lower bounds for 
the vertex-triangles problem, the ISG problem, and the (offline) unbounded DMOG problem. Since 
the other two versions of DMOG (uniformly and non-uniformly bounded) can solve the unbounded 
DMOG version, the lower bounds hold for those problems as well. We use the following theorem 
proven by Kopelowitz, Pettie, and Porat |22j . 

Theorem 5 ([22]). Assume 3SUM requires D(n^“°^^^) expected time. Then for any constant 0 < 
X < 1/2, any algorithm for enumerating all triangles in a graph G with m edges, 0(m^“*) vertices, 
and d = 6{G) = 0(m*), where d is the average degree of a vertex in G, must spend H(m-5(G)^“°*^^^) 
expected time. 

Proof of Theorem H] We reduce the triangle enumeration problem considered in Theorem [5] to the 
vertex-triangles problem. We preprocessing G and then answer vertex-triangles queries on each of 
the vertices thereby enumerating all of the triangles in G. If we assume a sub-polynomial 

reporting time, then by Theorem [5| either the preprocessing takes H(m • 5(G)^“°^^^) time or each 
query must cost at least ^ ^ = l}{{d ■ 6{G))^~°^^'>) time. □ 
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Proof of Theorem [H and Theorem 0 We reduce the vertex-triangles problem considered in Theo- 
rem[T]to ISG as follows. We preprocess the graph G for ISG queries. Now, when we want to answer 
a vertex-triangle query on some vertex u, we input all of the neighbors of u into the ISG algorithm. 
Thus, there is a one-to-one correspondence between the edges reported by the ISG algorithm and 
the triangles in the output of the vertex-triangles query. Since each vertex-triangle query must cost 
r 2 (d • amortized expected time then the amortized expected time spent for each of the 

d neighbors of u must be at least amortized expected time. Since ISG is a special 

case of DMOG, and given Theorem [21 the proof of Theorem [3] follows directly. □ 

Proof of Theorem\^ The proof is similar to the proofs of Theorems [2]and[3l First, we convert the 
input graph G of the vertex-triangles problem to a tripartite graph Gt by creating three copies of 
the vertices Vi, V 2 ) V 3 and for each edge (tt, v) in G we add 6 edges to Gt between all possible copies 
of u and v. We also add a dummy vertex to Gt with degree 0. Each triangle in G corresponds 
to a constant number of triangles in Gt- Let a be any positive integer and let /d = a + 2d. We 
use ISG to solve vertex-triangles queries in Theorem [21 but we only ask queries on the neighbors of 
vertices in Vi in a specially tailored way as follows. We first list the neighbors of u from V 2 , followed 
by a copies of the dummy vertex, and then list the neighbors from V 3 . Prom the construction of 
the tripartite graph and the input to the ISG algorithm, two vertices of an edge that is part of 
the output of the ISG algorithm must be separated in the input list by at least a vertices, and 
by at most the length of the list which is fd. Thus, the time spent on each vertex must be at 
least amortized expected time. Gontinuing the 

reduction to the bounded DMOG problem completes the proof. □ 

3 The Induced Subgraph Problem 


An Upper Bound via Graph Orientations. We make use of graph orientations, where the 
goal is to orient the graph edges while providing some guarantee on the out-degrees of the vertices. 
Formally, an orientation of an undirected graph G = {V, E) is called a c-orientation if every 
vertex has out-degree at most c > 1. The notion of graph degeneracy is closely related to graph 
orientations [3]. There is a simple linear time greedy algorithm by Chiba and Nishizeki [T2] that 
assigns a (5(G)-orientation of G. We use graph orientations for solving ISG problem as follows. 
First, we view a c-orientation as assigning “responsibility” for all data transfers occurring on an 
edge to one of its endpoints, depending on the direction of the edge in the orientation (regardless 
of the actual direction of the edge in the input graph G). We exploit this distinction by using the 
notation of an edge e = (u, v) as oriented from u to v, while e could be directed either from u to 
V or from v to u. We say that u is responsible for e, and that e is assigned to u. Furthermore, u 
is a responsible-neighbor of v and v is an assigned-neighbor of u. Notice that in a c-orientation the 
number of assigned-neighbors of any vertex is at most c, while the number of responsible-neighbors 
could be much larger than c. 

The Bipartite Graph. We begin by converting G = (V, E) to a bipartite graph by creating two 
copies of V called L (the left vertices) and R (the right vertices). For every edge (u,v) G E we 
add an edge in the bipartite graph from ul G L to vr G R, where is a copy of u and vr is a 
copy of V . All of the edges are directed from L to R. Furthermore, each vertex in V that arrives 
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during query time is replaced by its two copies, first the copy from R and then the copy from L. 
This ordering guarantees that a self loop in G is not mistakenly reported the first time its single 
vertex arrives. Notice that the degeneracy of G is unchanged, up to constant factors, due to this 
reduction. From here onwards we assume that G is already in this bipartite representation. 

The Data Structure. We preprocess G using the algorithm of |12] . thereby obtaining a c- 
orientation with c = S{G). Each vertex v & R maintains a reporting list C^, which is a linked list 
containing its responsible-neighbors that have already appeared during the current query. When a 
vertex u G i? arrives during query time, the elements in Cy correspond to (some of the) edges that 
should be reported. Thus, reporting each such edge takes constant time via a linear scanning of 
Cy. The remaining edges to be reported are all assigned to u, so scanning the edges assigned to 
u in 0{6{G)) time suffices for listing them. When a vertex u £ L arrives, u is marked as arrived 
and we scan its assigned-neighbors, adding u to their reporting lists. This also takes 0{5{G)) time. 
Thus, we have proven Theorem [6l 

Theorem 6. The ISG problem on a graph G with m edges and n vertices can be solved online with 
0{m + n) preprocessing time, 0{5{G) + op) time per query vertex, where op is the number of edges 
reported at vertex arrival, and 0{m) space. 

3.1 Uniformly Bounded Edge Occurrences 

In this case, the ISG problem is restricted with two positive integer parameters a and fd so that 
an edge {vj,Vi) can only be reported if a<i — j<l3. We still assume that the graph is bipartite, 
as before. The time window between j3 time units ago and a time units ago is called the active 
window. A reporting list Cy for a vertex v G R contains the responsible-neighbors of v which have 
appeared during the active window, without repetition. Since each responsible neighbor appears 
only once, the space consumption of all of these lists is 0{m). Furthermore, each vertex u £ L 
maintains an ordered list of time stamps Ty of the times u appeared in the current active window. 

Notice that the arrival of a vertex from R should be treated immediately, while the treatment 
of the arrival of a vertex from L should be delayed by a time units. Second, an algorithmic 
mechanism is needed for “forgetting” the arrival of a vertex after /3 time units. These are addressed 
by maintaining a list of the last fd vertices that have arrived during query time. The complete proof 
of Theorem [7] is given in Appendix [Cl 

Theorem 7. The Induced Subgraph problem with uniformly bounded edge occurrences on a graph 
G with m edges and n vertices can be solved with 0{m + n) preprocessing time, 0{5{G) + op) time 
per query vertex, where op is the number of edges reported at vertex arrival, and 0{m -|- fd) space. 

3.2 Non-Uniformly Bounded Edge Occurrences 

We now consider the generalization to non-uniformly bounded edge occurrences, where each edge 
e = {vj,Vi) has its own boundaries [oe, fde] and can only be reported if Oe < i — j < /de- Notice that 
in this case the input graph is a multi-graph. The active window for this ISG version is the time 
window between ft* = maXe6_E{/3e} and a* = mine 6 £;{Q:e} time units ago. 

Similar to Section EH a list of the last /d* vertices that have appeared is maintained so that a 
vertex can be removed from our data structure after fd* units of time. 
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4 DMOG via Graph Orientation 


When extending ISG to online DMOG, the longer subpatterns introduce new challenges that need 
to be addressed. It is helpful to still consider the bipartite graph presentation of the DMOG 
instance, where vertices correspond to subpatterns and edges correspond to patterns. We use the 
algorithms from Section [3] as basic building blocks in our algorithms for DMOG by treating a 
subpattern arriving as the vertex arriving in the appropriate graph. However, we now need to 
address the difficulties that arise from subpatterns being arbitrarily long strings. 

To start off, we need a mechanism for determining when a subpattern arrives. One way of 
doing this is by using the the Aho-Corasick (AG) Automaton [ 2 ], using a standard binary encoding 
technique so that each character costs 0(log |S|) worst-case time. For simplicity we assume that |S| 
is constant. However, while in the ISG problem each character corresponds to the arrival of at most 
one subpattern, in the DMOG with unbounded gaps each arriving character may correspond to 
several subpatterns which all arrive at once, since a subpattern could be a proper suffix of another 
subpattern. To address this issue we phrase the complexities of our algorithms in terms of Isc which 
is the maximum number of vertices in the bipartite graph that arrive due to a character arrival. 
This induces a multiplicative overhead of at most Isc in the query time per text character relative 
to the time used by the ISG algorithms. 

Finally, there is an issue arising from subpatterns no longer being of length one, which for 
simplicity we first discuss this in the unbounded case. When u G L arrives and it has an assigned 
vertex v & R where is the length of the subpattern associated with v, then we do not want 
to report the edge {u, v) until at least — 1 time units have passed, since the appearance of the 
subpattern of v should not overlap with the appearance of the subpattern of u. Similarly, in the 
bounded case, we must delay the removal of u from by at least — 1 time units. Notice that 
if we remove u from Cy after a delay of niy — 1, then we may be forced to remove a large number 
of such vertices at a given time, which may be unaffordable within our time bounds. Our solution 
is to delay the removal of u by M — 1 time units, where M is the length of the longest subpattern 
that corresponds to a vertex in R. This solves the issue of synchronization, however now there is a 
problem of some of the reporting lists being elements that should not be reported. Nevertheless, it 
is straightforward to spend time in a reporting list that corresponds to the size of the output using 
standard list and pointer techniques. Combining these ideas with the algorithms in Section [3] gives 
Theorems [ 8 l [9] and [IHl 

Theorem 8. The DMOG problem with one gap and unbounded gap borders can be solved with 
0{\D\) preprocessing time, 0{5 {Gd) ■ lsc + op) time per text character, where op is the number of 
patterns that are reported due to the character arriving, and 0{\D\) space. 

Theorem 9. The DMOG problem with uniformly bounded gap borders can be solved such that 
dictionary patterns are reported online in: 0{\D\) preprocessing time, 0{5 {Gd) -Isc + op) time per 
text character, where op is the number of patterns that are reported due to the character arriving, 
and 0{\D\ + Isc ■ {/3 — a + M) -|- a) space. 

Theorem 10. The DMOG problem with non-uniformly bounded gap borders can be solved such that 
dictionary patterns are reported online in: 0{\D\) preprocessing time, 0{5 {Gd) -Isc + op) time per 
text character, where op is the number of patterns that are reported due to the character arriving, 
and 0{\D\ + Isc ■ 5{GD){ld* — a* -|- M) + a*) space. 


5 DMOG via Threshold Orientations 


Sections [3] and m focus on orientations whose out-degree is bounded by 6{Gd)- Thus, when 6{Gd) = 
'/d the DMOG algorithms take 0{lsc-yfd) time. This is exacerbated in the non-uniform case where 
the degeneracy can be much larger, since the input graph is a multi-graph. In this section we show 
how in such dense inputs we can reduce the dependence on Isc ■ 6 {Gd) to y/lsc ■ d, by using a 
different method for orienting the graph which we refer to as a threshold orientation. 

Definition 2. A vertex in Go is heavy if it has more than sj d/lsc neighbors, and light otherwise. 

Our algorithms use two key properties. The first is that light vertices have at most d/lsc 
neighbors, and the second is that the number of heavy vertices is less than d/lsc. As in the 
previous algorithms, for each vertex u G i? we maintain either a reporting list in the uniform 
case or the data structure 5^ in the non-uniform case, and for each vertex u G T we maintain the 
list Tu- These structures enable dealing with edges where at least one of its endpoints is light, and 
so we orient all edges that touch a light vertex to leave that vertex, breaking ties arbitrarily if both 
vertices are light. In particular, if a vertex (heavy or light) u G L arrives then it adds the arrival to 
Tu, and if u is light then it updates the data-structures in all of its out-going neighbors in R. Since 
there are at most Isc vertices arriving at a time, this costs at most 0{lsc+Vlsc ■ d) time. If a vertex 
(heavy or light) u G i? arrives then it reports relevant information from its responsible-neighbors, 
which must all be light vertices, in time proportional to the output size, and if v is light then it 
can afford to scan all of its assigned-neighbors. Since there are at most Isc vertices arriving at a 
time, this costs at most 0{lsc + Vise ■ d + op) time. 

The remaining task is reporting edges between two heavy vertices. Prom a very high level, we 
will leverage the fact that the number of heavy vertices is less than Vise ■ d, and so even if the 
number of vertices from L that arrive at the same time can be as large as Isc and the number 
of neighbors of each such vertex can be very large, the number of vertices in R is still less than 
Vise ■ d. So using a batched scan on all of R will keep the time cost low. We show that after some 
preprocessing such a scan can produce the desired result. 

The tree structure. Out treatment of heavy vertices uses a special tree-like structure T among 
the subpatterns associated with vertices from L. In particular, the vertices of T are the 0{Vlsc ■ d) 
vertices from L, where a vertex u is an ancestor of a vertex v if and only if the subpattern associated 
with u is a suffix of the subpattern associated with v. To make the structure an actual tree we also 
add an additional special vertex corresponding to the empty string as the root of T since it is a 
suffix of every subpattern. The tree T can be constructed in linear time from the AC automaton 
of D. Notice that the depth of the tree is Isc. Furthermore, the graph vertices arriving due to a 
text character arrival correspond to all of the vertices on some path from the root of T to some 
vertex u, not including the root. This means that the arrival of u implies the arrival of all of its 
at most Isc ancestors, not including the root. We emphasize that the AC automaton mechanics 
allow one to report only one vertex u such that u and all of its ancestors exactly correspond to the 
subpatterns that have arrived (implicitly). 

We briefly overview the algorithm for the uniformly bounded case. The algorithm for the non- 
uniformly bounded case have a similar flavor. Let A: = |i?| and let R = {vi,V 2 ,... ,vV\. For each 
vertex u G L let Au be an array of size k where the Fth location in Au, denoted by Au[i], is a 
pointer to a list of all of the edges that connect an ancestor of u in T (which may be u) to Uj. 
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When u £ L arrives then for each pointer Ay\i] 7^ null we would like to add Ay\i] to the beginning 
of Cv^. Recall that if Ay[i\ was already in the reporting list then we remove the other older 
copy of Au[i] from Cy^, in order to keep the space usage linear (as in Section [3]). When a vertex 
Vi G R arrives we scan Cy^ as in Section IH and for each edge {u,Vi) in a Cy^ that has appeared 
in the appropriate time frame (since we need to skip the tail of Cy as in Section 2]) we scan the 
list Ty (which is still maintained as before) and report all of the appropriate edges. Finally, after 
/3 — a + M + 1 time units have passed since the last time u G L has updated some reporting lists, 
we remove all of the Ay\i] 7^ null from each Cy^. The rest of the implementation details discuss 
how the array Ay can be computed efficiently during the query phase, with low preprocessing time 
and space. 

The following Theorems are proven in Appendix iDl 

Theorem 11. The DMOGproblem with one gap and uniform gap borders can he solved with 0{\D\) 
preprocessing time, 0{lsc + y/lsc ■ d + op) time per text character, and 0{\D\ + lsc{/3 — a + M) + a) 
space. 

Theorem 12. The DMOG problem with one gap and non-uniform gap borders can be solved with 
0{\D\ + d{f3* — a*)) preprocessing time, 0{lsc + Vise ■ d{f3* — a* + M) + op) time per query text 
character, and 0{\D\ + d(/3* — a*) + y/lsc ■ d{(3* — a* -\- M) + a*) space. 
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Appendix 

A Triangle Reporting Problems and Other Popular Conjectures. 


As noted , many of the lower bounds conditioned on the 3SUM conjecture are proven via a series of 
reductions that pass through either set disjointness queries or set intersection queries (see IMl 11122]), 
as is the case with the triangle enumeration conditional lower bounds, which as stated lead to lower 
bounds for both edge-triangle queries and vertex-triangles queries. Since edge-triangle queries can 
be used to solve set disjointness, another natural candidate for a conjecture with which we can 
prove that edge-triangle queries are hard is the Boolean Matrix Multiplication (BMM) conjecture, 
which states that no combinatorial algorithnJl exists for BMM on two n x n matrices. 

This is because the answer to a set disjointness query corresponds to the inner product of the 
characteristic vectors of the two sets, and each entry in the output of BMM is the inner product of 
one row and one vector from the input matrices. Notice that this approach derives a conditional 
lower bound from the BMM conjecture for the decision version of edge-triangle queries (does there 
exist a triangle containing the query edge), which can be solved via the reporting version considered 
above if we stop the query process after the first triangle is reported, thereby obtaining a conditional 
lower bound for the reporting version itself. 

Just like the decision version of edge-triangle queries corresponds to the inner product of two 
boolean vectors, we can show that vertex-triangles queries correspond to the outer product of two 
vectors. However, outer products are too weak of a tool for proving conditional lower bounds from 
BMM, since the output of the outer product of two vectors of length n is n?, and in order to 
solve BMM using outer products we need to consider n pairs of vectors and their outer products, 
resulting in Q{n^) information which is already too much. 

Another candidate for proving the hardness of edge-triangle queries is the recent Online Matrix 
Vector (OMV) multiplication conjecture which states that there is no 0(n^“^(^^) algorithm for 
multiplying an n x n matrix with n vectors of length n each, where the vectors arrive online and 
the output of the Ah multiplication must be given prior to the arrival of the (i + l)th vector. Since 
multiplying a matrix with a vector corresponds can be solved via inner products, the connection 
to edge-triangle queries is clear. However, it is not clear how to use the OMV conjecture to prove 
some hardness on outer products, and so it is not clear if this conjecture can be used to prove the 
hardness of vertex-triangles queries, ISO, and DMOG. 


^There is no clear definition of a combinatorial algorithm, and the notion that is accepted by the algorithmic 
community is that the way to establish if an algorithm is combinatorial or not is done by just looking at it. 


13 



B Table of Upper Bounds 


Gaps 

Type 

Preprocessing 

Time 

Query Time 
per Text Character 

Space 

unbounded 

0{\D\) 

0{5{Gd) ■ Isc -t op) 

om) 

uniform 

bounds 

0{\D\) 

0{5{Gd) ■ Isc + op) 

0{\D\ + lsc{l3 — a + M) + a) 

non-uniform 

bounds 

0{\D\) 

0{5{Gd) ■ Isc + op) 

d{\D\ + lsc{P* -a* +M) + a*) 

uniform 

bounds 

0{\D\) 

0{lsc + ^/isc ■ d + op) 

0{\D\ + lsc{l3 — a -1- M) + a) 

non-uniform 

bounds 

0{\D\+d{f5*-a*)) 

0{Vlsc ■ d{P* — a* + M) + op) 

0{\D\ + d{p* - a*) 

+ -a* + M) + a*) 


Table 2; A summary of upper bounds for DMOG described in this paper. 


C Full Details for Section [3] 

Proof of Theorem\^ The process for treating the arrival of Vi is as follows. First, we remove Vi-js-i 
from the list of the last /3 vertices, and if E L then we remove the time stamp of i — /3 — 1 

from If becomes empty then we remove Vi-p-i from all of the reporting lists at the 

assigned-neighbors of Uj_, 3 _i. Next, if Vi-a E L then we treat it as if it just arrived. To do this, 
if Tvi-a empty before the current arrival, then we add Vi-a to the reporting lists of all of its 
assigned neighbors. We also add z — a to 

Finally, we are ready to treat u*. First, Vi is added to the list of the last /3 vertices. Then, if 
Vi G R then we treat it as if it arrived right now by: (1) accessing the vertices in and reporting 
the edges corresponding to their time stamp lists, and ( 2 ) scanning the assigned-neighbors of Vi to 
output any additional edges. The additional space usage for this part of the algorithm is another 
0(/3) words (for the list of the last /3 vertices), and the time cost per vertex is 0{5{G) + op). □ 

Proof of Theorem ??. Each vertex u £ L maintains as in Section I.S.ll Since not all of the 
occurrences of u are relevant when one of its responsible-neighbors from R has arrived, the cost of 
filtering the list is at most 0(log(/3* — a*) + k) where k is the size of the output. 

Similar to Section EH we maintain a list of the last (5* vertices that have arrived during query 
time. The process for treating the arrival of Vi is as follows. First we remove from the list 

of the last (3 vertices, and if E L then we remove the time stamp of i — /3* — 1 from 

Next, we remove the at most 5{G) points that were created by Vi- 0 *-i from all of the orthogonal 
reporting data-structures at the assigned-neighbors of Notice that we only remove the 

points that were due to the arrival of at time i — j3* — 1 (and not possibly other times). 

Next, if Vi-a* E L then we add i — a* to , and for each assigned-neighbor v of Vi-a* where 

e = {vi-a*,v) we insert the point {i + a^, i -|- /3e) into Sy. 

Finally, we are ready to treat Vi. First Vi is added to the list of the last /3 vertices. Then, if 
Vi G R then we: (1) execute a range reporting query in Sy as described above (looking for all points 
whose first coordinate is at most i and whose second coordinate is at least i) ( 2 ) and scan the at 
most S{G) assigned-neighbors of Vi to output any additional edges using the list of time stamps. 
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The additional space usage for the algorithm is 0{j3*) words for the list of the last /3* vertices 
and all of the lists of time stamps, and another 0{5{G){fi* — a*) log'^/®’''^(J(G)(/3* — a*))) words 
for all of the orthogonal range reporting data structures. The cost of processing a vertex in L that 
arrived either a* or /3* + 1 time units ago is 0{5{G) \o^{6{G){I3* — a*))) time to add/remove 
points into/from orthogonal range reporting structures. The cost of processing a vertex v £ R that 
just arrived, in addition to the size of the current output, is Q( ) time to query Sy 

and 0{6{G) log(/3* — a*)) time for scanning all of the assigned-neighbors of v. □ 

D Full Details for Section [5] 

Proof of Theorem, [Til Let k = |i2| and let R = {ui,U 2 ,... ,Vk}- For each vertex u G L let Ay be 
an array of size k where the i’th location in Ay, denoted by is a pointer to a list of all of 

the edges that connect an ancestor of u in T (which may be u) to Uj. When u £ L arrives then 
for each pointer Ay[i\ ^ null we would like to add Ay[i\ to the beginning of Recall that if 
Ay [z] was already in the reporting list £y. then we remove the other older copy of Ay [z] from Cy ^, 
in order to keep the space usage linear (as in Section [3]) . When a vertex Vi £ R arrives we scan 
as in Section m and for each edge {u,Vi) in a Cy^ that has appeared in the appropriate time frame 
(since we need to skip the tail of Cy as in Section 0]) we scan the list Ty (which is still maintained 
as before) and report all of the appropriate edges. Finally, after /3 — a + M + 1 time units have 
passed since the last time u £ L has updated some reporting lists, we remove all of the Ay\i\ ^ null 
from each Cy^. The rest of the implementation details discuss how the array Ay can be computed 
efficiently during the query phase, with low preprocessing time and space. 

Constructing arrays. We begin by leveraging the tree structure of T where a vertex that arrives 
implies that all of its ancestors arrived as well. For each vertex u £ L and for each vertex Vi £ R 
such that e = {u,Vi) £ Ed we maintain a pointer next{e) to an edge e' = {u',Vi) where u' is the 
lowest proper ancestor of zt in T such that there is an edge from u' to u,. If no such vertex u' exists 
then next{e) = null. It is straightforward to add these pointers in linear time, and their space 
usage is linear. Thus, it is enough for Ay[i\ to be a pointer to e and the list of all of the ancestors of 
zz in r that have edges touching Vi is obtained through the next{ ) pointers. Similarly, if for some 
i! there is no edge {u,Vii) then the entry of Ay\i'\ should point to the edge {u',Vi') where u' is the 
lowest proper ancestor of zz in T such that there is an edge from u' to Vi. If no such edge exists 
then At([z'] = null. In order to quickly find all of the entries of Ay when u arrives we will leverage 
a preprocessing phase. 

Let zz; : L —>• [d] be a weight function such that for zz € L the weight w{u) is the degree of u in 
Gd- Notice that the weight of any vertex is at least since all of the vertices are assumed to 

be heavy. 

The construction of these arrays will make use of the following procedure. Let zz and v be two 
vertices in T where z; is a proper ancestor of zz, and assume that Ay has already been constructed. 
In order to construct Ay we first initialize all of its entries to null. Next we traverse the vertices 
from zz to z; by order, and for each vertex w that we encounter on this path, for each edge {w, Vi), if 
Ay[i] = null then we set Ay[i] to be a pointer to (zz;, Vi). Once we reach v we fill in the null entries 
in Ay with their corresponding entries in Ay (some of which may also be null). The total time cost 
of this process is the total number of edges of all of the vertices on this path (notice that every 
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vertex has weight at least 1) which is the sum of the weights of vertices in this path, and another 
0{Vlsc ■ d) time for initializing and scanning Ay. 

In order to utilize the construction procedure that was just described, we will maintain arrays 

only for specially chosen vertices in T, such that whenever we need to construct an array 

Ay for some vertex u during the query phase where u is not special, the total weight of vertices 
on the path from u to its closest special ancestor will be 0{-s/lsc ■ d), so the total time cost for 
constructing Ay will be 0{-\/lsc ■ d). 


Choosing special vertices. We partition T into 0{y j^) small subtrees such that each subtree 
has total weight Vl{-s/lsc ■ d), except for possibly the subtree containing the root of T. This guaran¬ 
tees that the total number of small subtrees is The special vertices are the roots of these 

small subtrees. 

The partitioning is obtained by (greedily) peeling small subtrees in the bottom of T. Specifically, 
let Tu be the subtree of T rooted at u and let w{Tu) be the total weight of vertices in T„. Then we 
iteratively peel a subtree such that w{Tu) is at least Vise ■ d but the total weight of each subtree 
of a child of u in T is (separately) less than Vise ■ d. This peeling continues until no such subtree 
exists, in which case the remaining subtree must have total weight less than Vise ■ d and it is the 
last small subtree (it also contains the root). It is straightforward to implement the partitioning 
in linear time using a post-order traversal. Notice that by the construction method, for any vertex 
u that is not the root of T, the total weight of vertices on the path from u to its closest proper 
ancestor that is a special vertex is 0{Vlse ■ d). 

To compute the arrays for all of the special vertices we use a top-down approach by hrst 
constructing the array for the root of T and then we compute each Ay for a special vertex u 
only after the array for the closest proper special ancestor u of u was constructed. Using the 
construction procedure above and the property that the total weights of vertices on the path from 
tt to u is 0{Vlse ■ d) the time to construct the array for each special vertex is 0{Vlse ■ d). Since the 


number of special vertices 0{y the total preprocessing phase costs 0{d) time. Similarly, the 

process of constructing the array Ay for a vertex u ^ L that arrived during a query costs 0{Vlse ■ d) 
time. Notice that if u is not a special vertex then we construct A^ temporarily when u arrives, 
use Au in order to update the reporting lists of vertices in R, and then delete Ay. Moreover, if 
P — a + M + 1 time units have passed since the last time u has updated reporting lists, we again 
construct Ay only temporarily in order to delete the information that should be removed. □ 


Proof of TheoremMA As in the uniform case let R = {vi,V 2 , ■ ■ ■ ,Vk}. Each vertex Vi £ R will 
maintain a cyclic aetive window array AWi of size P* — a* + M + 1 where the j’th location in AWt, 
denoted by AITi[j], will be a pointer to a list of lists of edges that all need to be reported if Vi will 
appear in j — 1 time units from now. Since k = 0{Vlse ■ d), every time a vertex arrives during the 
query phase we shift all of the active window arrays by one position. This shift is implemented 
by incrementing the starting position of each active window array in a cyclic manner, costing 0(1) 
time per vertex in R. When a vertex Vi appears, all of the edges pointed to by location AITj[l] 
will be reported, in time proportional to the size of the output. In the remainder of this section we 
focus on how to maintain the active array windows correctly as vertices from L appear during the 
query phase. 
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We again leverage the tree structure of T where a vertex that arrives implies that all of its 
ancestors arrived as well. Recall that for an edge e its gap boundaries are denoted by and /3e- 
For each vertex u £ L and for each vertex Vi £ R such that e = {u, Vi) £ Eq we maintain an array 
nexte of size /3e — c^e + 1- It will be helpful to treat the indices of the array as starting from ae and 
ending at f3e- So for cte < j < f3e, the j’th location in nexte, denoted by nexte[j], points to an edge 
e' = {u',Vi) where u' is the lowest ancestor of tt in T (possibly u itself) such that: 

• There is an edge e' = {u',Vi). 

• OLe' <j< /3e'- 

• It is not possible to reach e by iteratively following nextr[j]- 

If no such edge exists then nexte\j] = null. The total space usage for all of the next pointer arrays is 
p := '^eeEo ~ + 1) < d{j3* — a*). Also, it is straightforward to construct these arrays in 0{p) 

time. Notice that from the way we constructed the next pointer arrays, for a given vertex u £ L 
all of the multi-edges (u,Vi) whose boundaries contain j form a contiguous list when considering 
the set of pointers nexte\j] for each such edge e. We denote the first edge on this contiguous list 
by headu^ij. Let the weight of a node u £ L, denoted by w{u), be the number of pointers of the 
form headu^ij / null. Notice that the total weight of all of the vertices in T is 0{d{a* — /?*)) (as 
opposed to the uniform case where the total weight was 0 {d)). 

For each vertex u £ L and Vi £ R let Wu,i be an array of size /3* — a* + 1. As before, it will 
be helpful to treat the indices of the array as starting from a* and ending at /3*. The j’th location 
in Wu,i, denoted by Wu,i[j], is a pointer to headu^ij thereby giving access to a list of all of the 
edges in Ed that: (1) touch Vi, (2) touch an ancestor of u in T, and (3) their boundaries contain j. 
When u £ L arrives then for each pointer ITn^i[j] 7^ null we would like to add ITn^i[j] to the list at 
AIFj [j + 1 + where niy^ is the length of the subpattern corresponding to Vi, since in j -|- 1 -|- my^ 
time units from now, ii Vi £ R arrives we will want to report all of the edges pointed to by Wu,i[j]. 

If we were to precompute all of the arrays Wu,i the time and space would be 0{lsc - d{f3* — a*)), 
which may be rather large. To reduce this cost we use techniques that we used in the uniform case, 
as follows. 

The construction of our array will make use of the following procedure. Let u and v be two 
vertices in T where u is a proper ancestor of u, and assume that we have already constructed Wy^i. 
In order to construct Wy^i we first initialize all of the entries to null. Next we traverse the vertices 
from n to u by order, and for each vertex w that we encounter on this path, for each headyy^ij, if 
Wudlj] = null then we set Wu,i[j] to be a pointer to headyy^ij. Once we reach v we fill in the null 
entries in Wy^i with their corresponding entries in Wy^i (some of which may also be null). The total 
time cost of this process is the total weight of vertices on this path (notice that every vertex has 
at least 1 head pointer), and another 0 {Vlsc ■ d{(3* — a*)) time for initializing Wy^i and scanning 

Wy,i. 

In order to utilize the construction procedure that was just described, we will maintain arrays 
only for specially chosen vertices in T, such that whenever we need to construct an array 

Ay for some vertex u during the query phase where u is not special, the total weight of vertices on 
the path from u to its closest special ancestor will be 0 {Vlsc ■ d{f3* — a*)), so the total time cost 
for constructing Wy^i will be 0 {Vlsc ■ d{/3* — a*)). 
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Choosing special vertices. As in the uniform case, we partition T into 0{\l 7^) small subtrees 


such that each subtree has total weight Q,{Vlsc ■ d{f3* — a*)), except for possibly the subtree con¬ 
taining the root of T. This guarantees that the total number of small subtrees is The 

special vertices are the roots of these small subtrees. 

The partitioning is obtained by (greedily) peeling small subtrees in the bottom of T, as in the 
uniform case, and costs linear time using a post-order traversal. Also, as in the uniform case, the 
total weight of vertices on the path from u to its closest proper ancestor that is a special vertex is 
0{Vlsc ■ d{f3* — a*)). Computing for all special vertices v and all i is executed using a top- 
down approach, as in the uniform case, and the time to construct the array for each special vertex 

is 0{\/lsc ■ d{/3* — a*)). Since the number of special vertices is the total preprocessing 

phase costs 0{d{/3* — a*)) time. Similarly, the process of constructing Wu,i for a vertex u G L 
that arrived during a query costs 0{^/lsc ■ d{(3* — a*)) time. The rest of the details also follow 
the ideas from the uniform case; Wu,i is temporarily constructed when u arrives, and again after 
l3* — a* + 1 + M time in order to delete the information that should be removed. □ 
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